Bus encoding using metadata

ABSTRACT

Various additional and alternative aspects are described herein. In some aspects, the present disclosure provides a method of communicating data between an electronic unit (EU) of a system-on-chip (SoC) and a dynamic random access memory (DRAM). The method includes compressing data at the EU. The method further includes encoding the compressed data at the EU. The method further includes generating control data for decoding the encoded data at the EU. The method further includes generating metadata corresponding to the compressed data at the EU. The metadata further includes the control data. The method further includes directing storage of the encoded data and the metadata in the DRAM.

TECHNICAL FIELD

The teachings of the present disclosure relate generally to transmission of data over buses and in particular embodiments to bus encoding using metadata in system-on-chip (SoC) applications.

INTRODUCTION

An integrated circuit (e.g., SoC) may utilize on chip bus interconnections between electronic units (EU), which reside on the integrated circuit. Accordingly, EUs on integrated circuits may be coupled to each other by buses. A “bus” may include a signal conductor or a plurality of conductors used to couple EUs and transfer data (e.g., bits) between EUs. An “electronic unit” may include electronic circuitry, which is coupled to other electronic circuitry by one or more buses. For example, an EU may be a microprocessor, a computer, a floating point co-processor, a graphics processing unit (GPU), an intellectual property (IP) core, processing unit, memory, controller, etc. An EU may be coupled to another EU by a data bus.

A SoC may include a plurality of EUs coupled by one or more buses. Such interconnected systems may resemble a quilt with different fabric pieces interconnected by stitches, hence a common term to describe such an interconnected system is “fabric.”

Reduction of power consumption is a common goal for electronic systems. For example, transmission of data across buses between components of the electronic system may consume power. Reducing such power consumed in the transmission of data across buses may therefore reduce power consumption overall in the electronic system. For example, reducing the power utilized for transmission of data over buses in system-on-chip (SoC) applications, may reduce the overall power consumed by the SoC.

BRIEF SUMMARY OF SOME EXAMPLES

The following presents a simplified summary of one or more aspects of the present disclosure, in order to provide a basic understanding of such aspects. This summary is not an extensive overview of all contemplated features of the disclosure, and is intended neither to identify key or critical elements of all aspects of the disclosure nor to delineate the scope of any or all aspects of the disclosure. Its sole purpose is to present some concepts of one or more aspects of the disclosure in a simplified form as a prelude to the more detailed description that is presented later.

In some aspects, the present disclosure provides a method of communicating data between an electronic unit (EU) of a system-on-chip (SoC) and a dynamic random access memory (DRAM). The method includes compressing data at the EU. The method further includes encoding the compressed data at the EU. The method further includes generating control data for decoding the encoded data at the EU. The method further includes generating metadata corresponding to the compressed data at the EU. The metadata further includes the control data. The method further includes directing storage of the encoded data and the metadata in the DRAM.

In some aspects, the present disclosure provides an electronic unit (EU) of a system-on-chip (SoC). The EU includes a bandwidth compression component configured to compress data. The EU further includes a bus encoding component configured to encode the compressed data and generate control data for decoding the encoded data. The EU further includes a bus protocol conversion component configured to generate metadata corresponding to the compressed data. The metadata further includes the control data. The bus protocol conversion component is further configured to direct storage of the encoded data and the metadata in a dynamic random access memory (DRAM).

In some aspects, the present disclosure provides an electronic unit (EU) of a system-on-chip (SoC). The EU includes means for compressing data at the EU. The EU further includes means for encoding the compressed data at the EU. The EU further includes means for generating control data for decoding the encoded data at the EU. The EU further includes means for generating metadata corresponding to the compressed data at the EU. The metadata further includes the control data. The EU further includes means for directing storage of the encoded data and the metadata in the DRAM.

In some aspects, the present disclosure provides a computer readable medium having instructions stored thereon for causing an electronic unit (EU) of a system-on-chip (SoC) to perform a method of communicating data between the EU and a dynamic random access memory (DRAM). The method includes compressing data at the EU. The method further includes encoding the compressed data at the EU. The method further includes generating control data for decoding the encoded data at the EU. The method further includes generating metadata corresponding to the compressed data at the EU. The metadata further includes the control data. The method further includes directing storage of the encoded data and the metadata in the DRAM.

These and other aspects of the invention will become more fully understood upon a review of the detailed description, which follows. Other aspects, features, and embodiments of the present invention will become apparent to those of ordinary skill in the art, upon reviewing the following description of specific, exemplary embodiments of the present invention in conjunction with the accompanying figures. While features of the present invention may be discussed relative to certain embodiments and figures below, all embodiments of the present invention can include one or more of the advantageous features discussed herein. In other words, while one or more embodiments may be discussed as having certain advantageous features, one or more of such features may also be used in accordance with the various embodiments of the invention discussed herein. In similar fashion, while exemplary embodiments may be discussed below as device, system, or method embodiments it should be understood that such exemplary embodiments can be implemented in various devices, systems, and methods.

BRIEF DESCRIPTION OF THE DRAWINGS

So that the manner in which the above-recited features of the present disclosure can be understood in detail, a more particular description, briefly summarized above, may be had by reference to aspects, some of which are illustrated in the appended drawings. It is to be noted, however, that the appended drawings illustrate only certain typical aspects of this disclosure and are therefore not to be considered limiting of its scope, for the description may admit to other equally effective aspects.

FIG. 1 is an illustration of an exemplary integrated circuit employing a system-on-chip (SoC) interconnection fabric, in accordance with certain aspects of the present disclosure.

FIG. 2 is an illustration of serial and parallel bus configurations, in accordance with certain aspects of the present disclosure.

FIG. 3 illustrates a system including a memory system for communicating data between an electronic unit (EU) (e.g., of a system-on-chip (SoC)) and a physical memory, in accordance with certain aspects of the present disclosure.

FIG. 3A illustrates a block diagram of an example of an EU of FIG. 3, in accordance with certain aspects of the present disclosure.

FIG. 3B illustrates a block diagram of an example of an EU of FIG. 3 configured to place control data related to encoding of compressed data in the metadata corresponding to the compressed data, in accordance with certain aspects of the present disclosure.

FIG. 4 illustrates example operations for communicating data between an electronic unit (EU) (e.g., of a system-on-chip (SoC)) and a physical memory, in accordance with certain aspects of the present disclosure.

FIG. 5 illustrates example operations for communicating data between an electronic unit (EU) (e.g., of a system-on-chip (SoC)) and a physical memory, in accordance with certain aspects of the present disclosure.

DETAILED DESCRIPTION

The detailed description set forth below in connection with the appended drawings is intended as a description of various configurations and is not intended to represent the only configurations in which the concepts described herein may be practiced. The detailed description includes specific details for the purpose of providing a thorough understanding of various concepts. However, it will be apparent to those skilled in the art that these concepts may be practiced without these specific details. In some instances, well known structures and components are shown in block diagram form in order to avoid obscuring such concepts.

Although the teachings of this disclosure are illustrated in terms of integrated circuits (e.g., a SoC), the teachings are applicable in other areas. The teachings disclosed should not be construed to be limited to SoC designs or the illustrated embodiments. The illustrated embodiments are merely vehicles to describe and illustrate examples of the inventive teachings disclosed herein.

FIG. 1 is an illustration of an exemplary integrated circuit 100 employing a SoC interconnection fabric 143, in accordance with certain aspects of the present disclosure. The illustrative fabric 143 includes EUs 101-113 coupled by buses 123-133.

Buses 123-133 transmit and receive data in discrete chunks comprising one or more bits between EUs 101-113. A bus transaction may communicate successive chunks of data in sequence. Each chunk of data in a bus transaction may be referred to as a beat. A beat is the smallest indivisible transmission of data across a bus. Each beat may transfer one or more bits of information, depending on the bus configuration.

In some aspects, EUs may be coupled together by separate buses or shared buses.

FIG. 2 is an illustration of serial and parallel bus configurations, in accordance with certain aspects of the present disclosure. For example, bus 209 illustrates a parallel bus configuration between EUs 201 and 203. Bus 209 as shown includes N conductors. In some aspects, each conductor can communicate one bit per beat. Accordingly, the N conductors between EUs 201 and 203 can communicate N bits per beat (e.g., a N bit word). In some aspects, though not shown, the bus 209 may include additional conductors between EUs 201 and 203 to communicate other information (e.g., control information, metadata, etc.).

Bus 211 illustrates a serial configuration between EUs 205 and 207. Bus 211 as shown includes a single conductor. Accordingly, the conductor between EUs 205 and 207 can communicate one bit per beat. Data transmitted across such serial buses commonly comprises a plurality of beats where bits are transmitted serially. In some aspects, though not shown, the bus 211 may include additional conductors between EUs 205 and 207 to communicate other information (e.g., control information, metadata, etc.).

The teachings herein may apply to both parallel and serial bus configurations, as well as variations thereof. For the sake of clarity, and to avoid unnecessary obfuscations of the teachings herein, common structures have been omitted. Throughout this disclosure common and well known structures not necessary for the understanding of the teachings herein are also omitted so that the inventive teachings herein are not obscured.

FIG. 3 illustrates a system 300 including a memory system 304 (e.g., dynamic random access memory (DRAM) system, such as a double data rate (DDR) DRAM system), in accordance with certain aspects of the present disclosure. The system 300 may be implemented in any computing device, including a personal computer, a workstation, a server, a portable computing device (PCD), such as a cellular telephone, a portable digital assistant (PDA), a portable game console, a palmtop computer, or a tablet computer. One or more components of the system 300 may be implemented on an integrated circuit (e.g., SoC).

As shown, the system 300 comprises various on-chip components, including one or more EUs 306 that may access (e.g., read/write) memory resources of the memory system 304. The EUs 306, as discussed, may comprise one or more processing units (e.g., central processing unit (CPU), graphics processing unit (GPU), digital signal processor (DSP), display processor, etc.), a video encoder, an intellectual property (IP) core, or other clients requesting read/write access to the memory system 304. The EUs 306 may be coupled to each other via one or more buses. Further, one or more of the EUs 306 are coupled to a system memory management unit (SMMU) 308 via a bus 305. For example, one or more EUs 306 may share a bus 305 to communicate with the SMMU 308, or some EUs 306 may be coupled via separate buses to the SMMU 308. Further, in some aspects, one or more EUs 306 may share the bus 305 to communicate with each other.

The SMMU 308 is coupled to memory controller 320 (e.g., DRAM controller, DDR controller, etc.). The SMMU 308 may be coupled to the memory controller 320 by the bus 330, which may be different than or the same as bus 305. The memory controller 320 may further be coupled to the physical memory 324 (e.g., DRAM, DDR RAM, etc.). The memory controller 320 may be coupled to the physical memory 324 by the bus 332, which may be different than or the same as bus 305 or bus 330.

EUs 306 may access the physical memory 324 via the SMMU 308 and the memory controller 320. For example, the SMMU 308 may provide a virtual memory to the EU 308 that allows the EU 308 to access a contiguous virtual memory address space for each process (e.g., software program, function, etc.) running on the EU 308 instead of directly accessing a physical memory address space, which may be fragmented or disjointed. For example, an EU 306 may generate a read/write command and send the command as a bus transaction to the SMMU 308 on the bus 305.

The SMMU 308 may translate the virtual memory address (VA) identified in the bus transaction received from the EU 306 and translate the VA to a physical memory address (PA) of the physical memory 324. The SMMU 308 may send a modified bus transaction corresponding to the bus transaction from the EU 306 to the memory controller 320 on the bus 330. The modified bus transaction may include the PA of the physical memory 324 instead of the VA of the original bus transaction from the EU 306. The memory controller 320 may handle the physical signaling and communication with the physical memory 324 to read/write data according to the modified bus transaction. For example, the memory controller 320 may contain the logic needed to read and write to the physical memory 324 and refresh the physical memory 324. The memory controller 320 may communicate with the physical memory 324 on the bus 332 to read/write data according to the bus transaction received from the SMMU 308.

The physical memory 324 may further send data (e.g., based on a read command) to the memory controller 320, which may send data to the SMMU 308, which may further send data to the EU 306.

FIG. 3A illustrates a block diagram of an example of an EU 306 (e.g., GPU), in accordance with certain aspects of the present disclosure. The EU 306 may be configured to utilize bandwidth compression/decompression (BWC/D) to reduce the bandwidth needed to transmit data on the buses between components of the system 300. For example, the EU 306, as shown, includes a BWC/D component 352 configured to apply bandwidth compression to data (e.g., image data) to reduce the bandwidth (e.g., size) of the data. The BWC/D component 352 may further be configured to decompress compressed data. The BWC/D component 352 may be implemented in hardware or software of the EU 306. Further, though the BWC/D component 352 is shown as a single element, in some aspects, the BWC/D component 352 may comprise separate components for compression and decompression (e.g., a bandwidth compression (BWC) component and a bandwidth decompression (BWD) component, respectively). Further, in some aspects, an EU 306 may include only one of the BWC and BWD components. By reducing the size of the data communicated over the buses and stored in memory, the power consumption of the system 300 may be reduced. For example, the bus power and memory power required to perform data writes may be reduced as fewer bits are transferred on the bus and written to the memory. Further, since the data stored in the memory is compressed, the number of bits read from the memory may also be reduced, therefore reducing the power required for a data read.

For example, the EU 306 may have data to write to the physical memory 324. The BWC/D component 352 may apply a compression technique (e.g., tile based compression scheme for image data divided into tiles) to the data to write to the physical memory 324. The BWC/D component 352 may pass the compressed data to a bus protocol conversion component 354. The bus protocol conversion component 354 may be implemented in hardware or software of the EU 306. The bus protocol conversion component 354 is configured to convert the compressed data to a bus transaction and send the data as a bus transaction to the SMMU 308 for further communication to the physical memory 324 as described herein. In some aspects, where the data comprises a tile of image data, the writing of the compressed tile may correspond to a single bus transaction to the SMMU 308.

In addition to transmitting the actual compressed data to the SMMU 308, the bus protocol conversion component 354 may communicate the address information (e.g., VA) of where to write the compressed data to the SMMU 308 over the bus 305. Further, the bus protocol conversion component 354 may send metadata about the compressed data to the SMMU 308 along with address information (e.g., VA) of where to write the metadata to the SMMU 308 over the bus 305. In some aspects, the bus protocol conversion component 354 may send the metadata to the SMMU 308 in a different bus transaction than the compressed data. The metadata my include information used for decompressing the compressed data. For example, the metadata may include an indication of the type of compression applied, a size of the data before compression, a size of the compressed data, header information, etc.

In some aspects, the same or another EU 306 may read the compressed data from the physical memory 324. In particular, as discussed, the bus protocol conversion component 354 may generate a bus transaction corresponding to a read command of compressed data (e.g., a single bus transaction to read a single compressed tile from the physical memory 324). The bus transaction may further indicate a VA of the compressed tile. The bus protocol conversion component 354 may send the bus transaction to the SMMU 308 over the bus 305. The bus protocol conversion component 354 may further generate a separate bus transaction corresponding to a read command of the metadata associated with the compressed data. The bus transaction may further indicate a VA of the metadata. The bus protocol conversion component 354 may send the bus transaction to the SMMU 308 over the bus 305

The SMMU 308 may further communicate over the bus 330 with the memory controller 320, which may communicate over the bus 332 with the physical memory 324. The memory controller 320 may receive the compressed data and corresponding metadata from the physical memory 324 over bus 332 (e.g., as separate bus transactions). The memory controller 320 may send the compressed data and corresponding metadata to the SMMU 308 over bus 330 (e.g., as separate bus transactions). The SMMU 308 may send the compressed data and corresponding metadata to the EU 306 over bus 305 (e.g., as separate bus transactions). The bus protocol conversion component 354 may convert the bus transaction(s) to the compressed data and metadata and pass the compressed data and metadata to the BWC/D component 352. The BWC/D component 352 may decompress the compressed data based on the corresponding metadata to retrieve the original data (e.g., tile data).

In some aspects, the EU 306, in addition to compressing data for transmission over buses 305, 330, and 332 and storage in the physical memory 324, may encode the compressed data to further reduce power consumption for communicating the data over the buses 305, 330, and 332.

In some aspects, the EU 306 may encode the compressed data to reduce a number of state transitions between bits of the compressed data transferred along the buses 305, 330, and 332. Changing the state of a data bit on a particular bus conductor may consume more power than transmitting the same bit successively. Accordingly, reducing the number of state transitions between successively communicated bits along the buses 305, 330, and 332 reduces power consumption. Therefore, in some aspects, the EU 306 encodes the compressed data so that the transitions of the data bits in successive beats of the bus transactions along the buses 305, 330, and 332 exhibit the minimum cumulative Hamming distance between all the successive data bits of the bus transactions (e.g., corresponding to the compressed data, metadata, etc.).

For example, when the same data bit value is transmitted twice, the value of that data bit is maintained and power is not lost switching between states. Simply put, for the sake of illustration, transmitting two successive “1” values over a bus takes less power than transmitting a “1” value followed by a “0” value because to transmit a “1” followed by a “0” consumes power in switching states from a “1” to a “0”. The total number of transitions on each data bus conductor, added up over all the conductors in the bus and all the data transmitted, may be designated as the Hamming distance of the data. The smallest number for the Hamming distance of the data to be transmitted may generally result in the fewest bus transitions and the least power used.

In some aspects, the Hamming distance of a transmission may not be the only factor considered for encoding the compressed data. A value of one state on a bus may consume more power than the value of another state on a bus. As an example, assume that a one-bit serial bus is transmitting an 8 beat message containing 8 bits. Further assume that maintaining a “1” state consumes 20% more power than maintaining a “0” state, for example due to leakage. Transmitting a data sequence of 1,0,0,0,0,0,0,0 will consume less power than transmitting 0,1,1,1,1,1,1,1 even though the Hamming distance is the same for both sequences. Consideration of the power to maintain a particular state on a bus may be referred to as weighting or weighing. Accordingly, in some aspects, the EU 306 may encode the compressed data based on the weighting of the data communicated, in addition or in alternative to encoding the compressed data based on the Hamming distance.

For example, the EU 306 may use weighted bus invert coding to encode the compressed data. In weighted bus invert coding an extra bit (e.g., bus invert bit) may be used to indicate if the data on the bus has been inverted or not. For example, 1,0,0,0,0,0,0,0 transmitted in a first beat followed by 0,1,1,1,1,1,1,1 transmitted in a second beat, could be transmitted as 1,0,0,0,0,0,0,0 (bus invert=0, i.e. not inverted) followed by 1,0,0,0,0,0,0,0 (bus invert=1, i.e. inverted). In such a case, only one bit (the bus invert bit) would switch instead of having 8 bits switch state. Further, in weighted bus invert coding the fact that a 1 state consumes more power than a 0 state would be taken into account and the pattern 1,0,0,0,0,0,0,0 would be preferred over the pattern 0,1,1,1,1,1,1,1, which will consume more energy in transmission due to the weighting of the power in the different states.

In some aspects, to implement the encoding of the compressed data (e.g., based on the Hamming distance, weighting, or other power reduction schemes), the components of the system 300 may need to pass additional control data (e.g., control information) for interpreting (e.g., decoding) the encoded compressed data. For example, the control data may indicate whether bus encoding has been applied on the transaction, whether encoding has been applied to a particular beat of the bus transaction, the type of encoding applied, reordering of the words in the transaction, etc.

In some aspects, such control data cannot be passed as part of the bus transaction, but instead needs to be passed as sideband signaling between the components of the system 300 (e.g., the EU 306, the SMMU 308, and the memory controller 320) along the path the compressed data and metadata is communicated. Such sideband signaling may occur over additional conductors between each of the components of the system 300. Such additional conductors may utilize additional die space for implementing the system 300, and further utilize additional power consumption for passing the sideband signals.

In addition, the physical memory 324 may not be configured to utilize the control data or store the control data since it is not configured to handle sideband signaling. Accordingly, the memory controller 320 may decode the encoded data and then store only the compressed data (i.e., not encoded) in the physical memory 324. Accordingly, when a read operation of the compressed data is performed, the compressed data passed between the components of the system 300 (e.g., physical memory 324, memory controller 320, SMMU 308, EU 306, etc.) may not be encoded and therefore may not have reduced power consumption for communicating the data between the components along the buses 305, 330, and 332.

Compression of data may be different from encoding of data as described herein. In particular, compression of data may refer to reducing the number of bits utilized to represent the data, but does not specifically account for how the particular arrangement or value of bits would affect power consumption when transferred over a bus or stored in memory. Encoding of data may refer to substituting a bit pattern for another bit pattern, even if the substitute bit pattern has the same, fewer, or a greater number of bits than the original bit pattern. In particular, encoding (e.g., based on the Hamming distance, weighting, or other power reduction schemes) of data, as discussed herein, reduces power for communicating the data over a bus. Such encoding may be referred to as bus encoding.

Certain aspects of the disclosure provide systems and methods for implementing bus encoding using metadata. In particular, as discussed above, data (e.g., tile data) to be communicated over a bus may be compressed and metadata for decompressing the compressed data may be generated by an EU. The compressed data and metadata may be communicated and stored by an EU to a physical memory, as discussed. Further, the EU may apply encoding to the compressed data (e.g., and metadata) communicated and stored. In certain aspects, control data regarding how the compressed data is encoded may be transmitted along with the compressed data for the compressed data to be properly decoded. Accordingly, in certain aspects, systems and methods herein relate to placing control data related to the encoding in the metadata corresponding to the compressed data.

By placing control data related to the encoding in the metadata corresponding to the compressed data, the control data may be communicated over the buses between components of a system instead of utilizing additional sideband signaling. Accordingly, additional conductors are not needed for such sideband signaling. Further, since the control data is part of the metadata, and the metadata is written to the physical memory, the encoded data can be written to the physical memory, instead of a memory controller having to decode the data and write decoded data to the physical memory. Communicating encoded data between the memory controller and the physical memory can reduce the power consumption for communicating the data between the memory controller and the physical memory. Further, writing encoded data to the physical memory may reduce a number of toggles of the physical memory for storing data in the memory, and therefore reduce the power consumption for storing the data in the physical memory. In addition, when the data is read from the physical memory, it is still encoded and therefore the power consumption for communicating the encoded data in a read transaction between components of the system is also reduced.

FIG. 3B illustrates a block diagram of an example of an EU 306 (e.g., GPU) configured to place control data related to encoding of compressed data in the metadata corresponding to the compressed data, in accordance with certain aspects of the present disclosure. In other words, the metadata includes the control data. As shown, the EU 306 of FIG. 3B is similar to the EU 306 of FIG. 3A. However, the EU 306, in addition to the BWC/D component 352 and the bus protocol conversion component 354, further includes a bus coding component 360. The bus coding component 360 may be configured to apply bus encoding to data (e.g., compressed data) from the BWC/D component 352. The bus coding component 360 may further be configured to decode encoded data (e.g., compressed data). Though the bus coding component 360 is shown as a single element, in some aspects, the bus coding component 360 may comprise separate components for encoding and decoding (e.g., a bus encoding component and a bus decoding component). Further, in some aspects, an EU 306 may include only one of the bus encoding component and the bus decoding component. The bus coding component 360 may then pass the encoded compressed data to the bus protocol conversion component 354 for further communication with each of the SMMU 308, memory controller 320, and physical memory 324 over buses 305, 330, and 332, respectively. The bus coding component 360 may further be configured to decode encoded data received as part of a read transaction. In some aspects, each of the SMMU 308 and memory controller 320 may further be configured to read the control data placed in the metadata to interpret the encoded compressed data.

For example, the EU 306 may have data to write to the physical memory 324. The BWC/D component 352 may apply a compression technique (e.g., tile based compression scheme for image data divided into tiles) to the data to write to the physical memory 324. The BWC/D component 352 may pass the compressed data to the bus coding component 360. The bus coding component 360 may encode the compressed data (e.g., utilizing bus encoding, such as described herein). The bus coding component 360 may further generate control data for interpreting/decoding the encoded compressed data. The bus coding component 360 further passes the control data and encoded compressed data to the bus protocol conversion component 354.

The bus protocol conversion component 354 is configured to convert the encoded compressed data to a bus transaction and send the data as a bus transaction to the SMMU 308 for further communication to the physical memory 324 as described herein. In addition to transmitting the actual encoded compressed data to the SMMU 308, the bus protocol conversion component 354 may communicate the address information (e.g., VA) of where to write the encoded compressed data to the SMMU 308 over the bus 305.

Further, the bus protocol conversion component 354 may generate and send metadata about the encoded compressed data to the SMMU 308 along with address information (e.g., VA) of where to write the metadata to the SMMU 308 over the bus 305. The bus protocol conversion component 354 includes the control data for decoding the encoded compressed data received from the bus encoding component 360 in the metadata generated and sent to the SMMU 308.

In some aspects, the metadata may include an indication of the size of the compressed data. Further, the control data included in the metadata may include an indication of which beats of the bus transaction corresponding to the compressed data are encoded or not. For example, the metadata may comprise 8 bits. In some aspects, the 8 bits are defined as follows: 1 bit indicating the size of the compressed data (e.g., is it compressed or not), and 7 bits indicating whether or not each of the last 7 beats of the transaction are encoded or not. Additionally or alternatively, the metadata may include an indication of the type of compression applied, a size of the data before compression, a size of the compressed data, header information, etc. Additionally or alternatively, the control data may include an indication of whether bus inversion has been used for encoding, whether encoding has been applied to a particular beat of the bus transaction, a type of encoding applied, etc. In some aspects, the bus protocol conversion component 354 may send the metadata to the SMMU 308 in a different bus transaction than the compressed data over the bus 305. The metadata my include information used for decompressing the compressed data.

In some aspects, the SMMU 308 may utilize the control data in the metadata to interpret the encoded compressed data. The SMMU 308 may further translate the VA for storing the encoded compressed data to a PA in the physical memory 324. The SMMU 308 may further communicate the encoded compressed data and the metadata to the memory controller 320 over the bus 330. In some aspects, the memory controller 320 may utilize the control data in the metadata to interpret the encoded compressed data. The memory controller 320 may communicate the encoded compressed data and metadata to the physical memory 324 and direct storage of the encoded compressed data and metadata in the physical memory 324.

In some aspects, the same or another EU 306 may read the encoded compressed data from the physical memory 324. In particular, as discussed, the bus protocol conversion component 354 may generate a bus transaction corresponding to a read command of the encoded compressed data (e.g., a single bus transaction to read a single compressed tile from the physical memory 324). The bus transaction may further indicate a VA of the encoded compressed tile. The bus protocol conversion component 354 may send the bus transaction to the SMMU 308 over the bus 305. The bus protocol conversion component 354 may further generate a separate bus transaction corresponding to a read command of the metadata associated with the encoded compressed data. The bus transaction may further indicate a VA of the metadata. The bus protocol conversion component 354 may send the bus transaction to the SMMU 308 over the bus 305

The SMMU 308 may further communicate over the bus 330 with the memory controller 320, which may communicate over the bus 332 with the physical memory 324. The memory controller 320 may receive the encoded compressed data and corresponding metadata from the physical memory 324 over bus 332 (e.g., as separate bus transactions). The memory controller 320 may send the encoded compressed data and corresponding metadata to the SMMU 308 over bus 330 (e.g., as separate bus transactions). The SMMU 308 may send the encoded compressed data and corresponding metadata to the EU 306 over bus 305 (e.g., as separate bus transactions). The bus protocol conversion component 354 may convert the bus transaction(s) to the encoded compressed data and metadata and pass the encoded compressed data and metadata to the bus coding component 360. The bus coding component 360 may decode the encoded compressed data based on the control data included in the metadata. The bus coding component 360 may pass the compressed data and metadata to the BWC/D component 352. The BWC/D component 352 may decompress the compressed data based on the corresponding metadata to retrieve the original data (e.g., tile data).

FIG. 4 illustrates example operations for communicating data between an electronic unit (EU) (e.g., of a system-on-chip (SoC)) and a physical memory, in accordance with certain aspects of the present disclosure.

At 405, the EU (e.g., EU 306) compresses data (e.g., image data). For example, a BWC/D component (e.g., BWC/D component 352) of the EU may apply bandwidth compression to the data to form image data. In some aspects, the BWC/D component may also generate metadata corresponding to the compressed data.

At 410, the EU encodes the compressed data. For example, a bus coding component (e.g., bus coding component 360) of the EU applies a bus encoding to reduce a power for transmitting the compressed data along one or more buses. In some aspects, the bus encoding component of the EU further generates control data corresponding to the encoded compressed data.

At 415, the EU directs storage of the encoded data and the metadata in a physical memory (e.g., physical memory 324). For example, a bus protocol conversion component (e.g., bus protocol conversion component 354) may generate bus transactions for writing the encoded compressed data and metadata in the physical memory. In some aspects, the bus protocol conversion component includes generates the metadata and includes the control data in the metadata. The bus protocol conversion component may send the bus transactions over a bus to a SMMU (e.g., SMMU 308). Further, the SMMU generates and sends bus transactions for writing the encoded compressed data and metadata to a memory controller (e.g., memory controller 320). The memory controller generates and sends bus transactions for writing the encoded compressed data and metadata to the physical memory. The physical memory stores the encoded compressed data and metadata.

FIG. 5 illustrates example operations for communicating data between an electronic unit (EU) (e.g., of a system-on-chip (SoC)) and a physical memory, in accordance with certain aspects of the present disclosure.

At 505, the EU (e.g., EU 306) generates a read transaction to read encoded compressed data from a physical memory (e.g., physical memory 324). For example, a bus protocol conversion component (e.g., bus protocol conversion component 354) of the EU may generate bus transactions for reading the encoded compressed data and corresponding metadata from the physical memory.

At 510, the EU communicates the bus transactions over a bus to the physical memory. For example, the bus protocol conversion component may send the bus transactions over a bus to a SMMU (e.g., SMMU 308). Further, the SMMU generates and sends bus transactions for reading the encoded compressed data and metadata to a memory controller (e.g., memory controller 320). The memory controller generates and sends bus transactions for reading the encoded compressed data and metadata to the physical memory. The physical memory retrieves the stored encoded compressed data and metadata.

At 515, the EU receives the encoded compressed data from the physical memory. For example, the memory controller receives the encoded compressed data and metadata from the physical memory. The memory controller generates and sends bus transactions including the compressed data and metadata to the SMMU. The SMMU generates and sends bus transactions including the encoded compressed data and metadata to the bus protocol conversion component.

At 520, the EU decodes the encoded compressed data based on control data included in the metadata corresponding to the encoded compressed data. For example, the bus protocol conversion component passes the encoded compressed data and metadata to a bus coding component (e.g., bus coding component 360) of the EU. The bus coding component decodes the encoded compressed data based on control data included in the metadata.

At 525, the EU decompresses the compressed data based on the metadata. For example, the bus coding component passes the decoded compressed data and metadata to a BWC/D component (e.g., BWC/D component 352) of the EU. The BWC/D component may decompress the compressed data based on the metadata to retrieve the uncompressed data.

In some configurations, the term(s) ‘communicate,’ ‘communicating,’ and/or ‘communication’ may refer to ‘receive,’ ‘receiving,’ ‘reception,’ and/or other related or suitable aspects without necessarily deviating from the scope of the present disclosure. In some configurations, the term(s) ‘communicate,’ ‘communicating,’ ‘communication,’ may refer to ‘transmit,’ ‘transmitting,’ ‘transmission,’ and/or other related or suitable aspects without necessarily deviating from the scope of the present disclosure.

Within the present disclosure, the word “exemplary” is used to mean “serving as an example, instance, or illustration.” Any implementation or aspect described herein as “exemplary” is not necessarily to be construed as preferred or advantageous over other aspects of the disclosure. Likewise, the term “aspects” does not require that all aspects of the disclosure include the discussed feature, advantage or mode of operation. The term “coupled” is used herein to refer to the direct or indirect coupling between two objects. For example, if object A physically touches object B, and object B touches object C, then objects A and C may still be considered coupled to one another-even if they do not directly physically touch each other. For instance, a first object may be coupled to a second object even though the first object is never directly physically in contact with the second object. The terms “circuit” and “circuitry” are used broadly, and intended to include both hardware implementations of electrical devices and conductors that, when connected and configured, enable the performance of the functions described in the present disclosure, without limitation as to the type of electronic circuits.

One or more of the components, steps, features and/or functions illustrated herein may be rearranged and/or combined into a single component, step, feature or function or embodied in several components, steps, or functions. Additional elements, components, steps, and/or functions may also be added without departing from novel features disclosed herein. The apparatus, devices, and/or components illustrated herein may be configured to perform one or more of the methods, features, or steps described herein. The novel algorithms described herein may also be efficiently implemented in software and/or embedded in hardware.

It is to be understood that the specific order or hierarchy of steps in the methods disclosed is an illustration of exemplary processes. Based upon design preferences, it is understood that the specific order or hierarchy of steps in the methods may be rearranged. The accompanying method claims present elements of the various steps in a sample order, and are not meant to be limited to the specific order or hierarchy presented unless specifically recited therein.

The previous description is provided to enable any person skilled in the art to practice the various aspects described herein. Various modifications to these aspects will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other aspects. Thus, the claims are not intended to be limited to the aspects shown herein, but are to be accorded the full scope consistent with the language of the claims, wherein reference to an element in the singular is not intended to mean “one and only one” unless specifically so stated, but rather “one or more.” Unless specifically stated otherwise, the term “some” refers to one or more. A phrase referring to “at least one of” a list of items refers to any combination of those items, including single members. As an example, “at least one of: a, b, or c” is intended to cover: a; b; c; a and b; a and c; b and c; and a, b and c. All structural and functional equivalents to the elements of the various aspects described throughout this disclosure that are known or later come to be known to those of ordinary skill in the art are expressly incorporated herein by reference and are intended to be encompassed by the claims. Moreover, nothing disclosed herein is intended to be dedicated to the public regardless of whether such disclosure is explicitly recited in the claims. No claim element is to be construed under the provisions of 35 U.S.C. § 112(f) unless the element is expressly recited using the phrase “means for” or, in the case of a method claim, the element is recited using the phrase “step for.”

These apparatus and methods described in the detailed description and illustrated in the accompanying drawings by various blocks, modules, components, circuits, steps, processes, algorithms, etc. (collectively referred to as “elements”). These elements may be implemented using hardware, software, or combinations thereof. Whether such elements are implemented as hardware or software depends upon the particular application and design constraints imposed on the overall system.

By way of example, an element, or any portion of an element, or any combination of elements may be implemented with a “processing system” that includes one or more processors. Examples of processors include microprocessors, microcontrollers, digital signal processors (DSPs), field programmable gate arrays (FPGAs), programmable logic devices (PLDs), state machines, gated logic, discrete hardware circuits, and other suitable hardware configured to perform the various functionality described throughout this disclosure. One or more processors in the processing system may execute software. Software shall be construed broadly to mean instructions, instruction sets, code, code segments, program code, programs, subprograms, software modules, applications, software applications, software packages, firmware, routines, subroutines, objects, executables, threads of execution, procedures, functions, etc., whether referred to as software, firmware, middleware, microcode, hardware description language, or otherwise.

Accordingly, in one or more exemplary embodiments, the functions described may be implemented in hardware, software, or combinations thereof. If implemented in software, the functions may be stored on or encoded as one or more instructions or code on a computer-readable medium. Computer-readable media includes computer storage media. Storage media may be any available media that can be accessed by a computer. By way of example, and not limitation, such computer-readable media can comprise RAM, ROM, EEPROM, PCM (phase change memory), flash memory, CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other medium that can be used to carry or store desired program code in the form of instructions or data structures and that can be accessed by a computer. Disk and disc, as used herein, includes compact disc (CD), laser disc, optical disc, digital versatile disc (DVD), floppy disk and Blu-ray disc where disks usually reproduce data magnetically, while discs reproduce data optically with lasers. Combinations of the above should also be included within the scope of computer-readable media. 

What is claimed is:
 1. A method of communicating data between an electronic unit (EU) of a system-on-chip (SoC) and a dynamic random access memory (DRAM), the method comprising: compressing data at the EU by reducing a number of bits utilized to represent the data; encoding the compressed data at the EU to reduce a number of state transitions between bits of the compressed data transferred along a bus; generating control data for decoding the encoded data at the EU; generating metadata corresponding to the compressed data at the EU, the metadata including information used for decompressing the compressed data, the metadata further including the control data; directing storage of the encoded data and the metadata in the DRAM as separate bus transactions along the bus.
 2. The method of claim 1, wherein the control data comprises one or more of an indication of whether bus inversion has been applied to the compressed data and a type of encoding applied to the compressed data.
 3. The method of claim 1, further comprising: directing reading of the encoded data and the metadata from the DRAM, wherein the encoded data remains encoded during transfer between the EU and the DRAM.
 4. The method of claim 1, wherein encoding the compressed data comprises minimizing a hamming distance between bits of the compressed data.
 5. The method of claim 1, wherein directing storage of the encoded data in the DRAM comprises directing a write transaction of the encoded data to the DRAM, the write transaction comprising a plurality of beats, and wherein the control data indicates whether each beat of the plurality of beats is encoded or not.
 6. The method of claim 1, wherein directing storage of the encoded data comprises: converting encoded data to a bus transaction; and sending the bus transaction to a system memory management unit (SMMU) over a bus interface, wherein the SMMU interfaces with a DRAM controller over the bus interface, and wherein the DRAM controller interfaces with the DRAM over the bus interface.
 7. An electronic unit (EU) of a system-on-chip (SoC) comprising: a bandwidth compression component configured to compress data by reducing a number of bits utilized to represent the data; a bus encoding component configured to encode the compressed data and generate control data for decoding the encoded data to reduce a number of state transitions between bits of the compressed data transferred along a bus; and a bus protocol conversion component configured to: generate metadata corresponding to the compressed data, the metadata including information used for decompressing the compressed data, the metadata further including the control data; and direct storage of the encoded data and the metadata in a dynamic random access memory (DRAM) as separate bus transactions along the bus.
 8. The EU of claim 7, wherein the control data comprises one or more of an indication of whether bus inversion has been applied to the compressed data and a type of encoding applied to the compressed data.
 9. The EU of claim 7, wherein the bus protocol conversion component is further configured to: direct reading of the encoded data and the metadata from the DRAM, wherein the encoded data remains encoded during transfer between the EU and the DRAM.
 10. The EU of claim 7, wherein to encode the compressed data comprises to minimize a hamming distance between bits of the compressed data.
 11. The EU of claim 7, wherein to direct storage of the encoded data in the DRAM comprises to direct a write transaction of the encoded data to the DRAM, the write transaction comprising a plurality of beats, and wherein the control data indicates whether each beat of the plurality of beats is encoded or not.
 12. The EU of claim 7, wherein to direct storage of the encoded data comprises to: convert encoded data to a bus transaction; and send the bus transaction to a system memory management unit (SMMU) over a bus interface, wherein the SMMU interfaces with a DRAM controller over the bus interface, and wherein the DRAM controller interfaces with the DRAM over the bus interface.
 13. An electronic unit (EU) of a system-on-chip (SoC) comprising: means for compressing data at the EU by reducing a number of bits utilized to represent the data; means for encoding the compressed data at the EU to reduce a number of state transitions between bits of the compressed data transferred along a bus; means for generating control data for decoding the encoded data at the EU; means for generating metadata corresponding to the compressed data at the EU, the metadata including information used for decompressing the compressed data, the metadata further including the control data; and means for directing storage of the encoded data and the metadata in the DRAM as separate bus transactions along the bus.
 14. The EU of claim 13, wherein the control data comprises one or more of an indication of whether bus inversion has been applied to the compressed data and a type of encoding applied to the compressed data.
 15. The EU of claim 13, further comprising: means for directing reading of the encoded data and the metadata from the DRAM, wherein the encoded data remains encoded during transfer between the EU and the DRAM.
 16. The EU of claim 13, wherein means for encoding the compressed data comprises means for minimizing a hamming distance between bits of the compressed data.
 17. The EU of claim 13, wherein means for directing storage of the encoded data in the DRAM comprises means for directing a write transaction of the encoded data to the DRAM, the write transaction comprising a plurality of beats, and wherein the control data indicates whether each beat of the plurality of beats is encoded or not.
 18. The EU of claim 13, wherein means for directing storage of the encoded data comprises: means for converting encoded data to a bus transaction; and means for sending the bus transaction to a system memory management unit (SMMU) over a bus interface, wherein the SMMU interfaces with a DRAM controller over the bus interface, and wherein the DRAM controller interfaces with the DRAM over the bus interface.
 19. A non-transitory computer readable medium having instructions stored thereon for causing an electronic unit (EU) of a system-on-chip (SoC) to perform a method of communicating data between the EU and a dynamic random access memory (DRAM), the method comprising: compressing data at the EU by reducing a number of bits utilized to represent the data; encoding the compressed data at the EU to reduce a number of state transitions between bits of the compressed data transferred along a bus; generating control data for decoding the encoded data at the EU; generating metadata corresponding to the compressed data at the EU, the metadata including information used for decompressing the compressed data, the metadata further including the control data; and directing storage of the encoded data and the metadata in the DRAM as separate bus transactions along the bus.
 20. The non-transitory computer readable medium of claim 19, wherein the control data comprises one or more of an indication of whether bus inversion has been applied to the compressed data and a type of encoding applied to the compressed data.
 21. The non-transitory computer readable medium of claim 19, wherein the method further comprises: directing reading of the encoded data and the metadata from the DRAM, wherein the encoded data remains encoded during transfer between the EU and the DRAM.
 22. The non-transitory computer readable medium of claim 19, wherein encoding the compressed data comprises minimizing a hamming distance between bits of the compressed data.
 23. The non-transitory computer readable medium of claim 19, wherein directing storage of the encoded data in the DRAM comprises directing a write transaction of the encoded data to the DRAM, the write transaction comprising a plurality of beats, and wherein the control data indicates whether each beat of the plurality of beats is encoded or not.
 24. The non-transitory computer readable medium of claim 19, wherein directing storage of the encoded data comprises: converting encoded data to a bus transaction; and sending the bus transaction to a system memory management unit (SMMU) over a bus interface, wherein the SMMU interfaces with a DRAM controller over the bus interface, and wherein the DRAM controller interfaces with the DRAM over the bus interface. 